Generative AI

May 14, 2024

Generate Text Responses from Visual and Text Inputs with Google's New PaliGemma Model

With free NVIDIA cloud credits, you can start testing the model at scale on the API Catalog.

1 MIN READ

May 14, 2024

NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support

NVIDIA today announced the latest release of NVIDIA TensorRT, an ecosystem of APIs for high-performance deep learning inference. TensorRT includes inference...

7 MIN READ

Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 2.

May 13, 2024

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 2

In the first post, we walked through the prerequisites for a neural machine translation example from English to Chinese, running the pretrained model with NeMo,...

11 MIN READ

Decorative image of a globe surrounded by people speaking and texting in different languages, with the text Part 1.

May 13, 2024

Customizing Neural Machine Translation Models with NVIDIA NeMo, Part 1

Neural machine translation (NMT) is an automatic task of translating a sequence of words from one language to another. In recent years, the development of...

8 MIN READ

May 13, 2024

Regional LLMs SEA-LION and SeaLLM Serve Languages and Cultures of Southeast Asia

At the recent World Governments Summit in Dubai, NVIDIA CEO Jensen Huang emphasized the importance of sovereign AI, which refers to a nation’s capability to...

3 MIN READ

May 12, 2024

Enabling Quantum Computing with AI

Building a useful quantum computer in practice is incredibly challenging. Significant improvements are needed in the scale, fidelity, speed, reliability, and...

6 MIN READ

Decorative image of multimodal RAG workflow.

May 12, 2024

Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing

In the rapidly evolving field of software development, AI tools such as chatbots and GitHub Copilot have significantly transformed how developers write and...

8 MIN READ

May 10, 2024

Dynamic Control Flow in CUDA Graphs with Conditional Nodes

CUDA Graphs can provide a significant performance increase, as the driver is able to optimize execution using the complete description of tasks and...

7 MIN READ

May 08, 2024

Amdocs Accelerates Generative AI Performance and Lowers Costs with NVIDIA NIM

Telecommunications companies (telcos) are leveraging generative AI to increase employee productivity by automating processes, improving customer experiences,...

10 MIN READ

May 08, 2024

Accelerate Generative AI Inference Performance with NVIDIA TensorRT Model Optimizer, Now Publicly Available

In the fast-evolving landscape of generative AI, the demand for accelerated inference speed remains a pressing concern. With the exponential growth in model...

9 MIN READ

May 08, 2024

Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints

Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more...

13 MIN READ

Image of a gridded cube with purple and green dots.

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

Decorative image of VILA and Jetson Orin workflow.

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...

11 MIN READ

May 01, 2024

Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD

With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...

5 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ